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The nucleotide sequence between the peplomer and matrix protein genes in the 
genome of the Purdue strain of porcine transmissible gastroenteritis coronavirus 
(TGEV) was determined by sequencing parts of six cDNA clones. Open reading 
frames potentially encoding proteins of 7,711,27,711, and 9,241 Da were identified 
(Fig. 1). The sequence for this region of the genome for the same strain of virus was 
published by Rasschaert et al. (3), but our sequence differs by two bases, one of 
which results in a major change in the properties of the second open reading 
frame (ORF). G in our sequence at position 433, rather than T, enlarges the second 
open reading frame from 165 to 244 amino acids and establishes a sequence con¬ 
text more favorable for initiation of translation (4). C in our sequence at position 
606, rather than T, changes Leu to Pro. Each open reading frame is preceded by a 
sequence that is similar to the CTAAAC intergenic sequence thought to be re¬ 
quired for leader-rimed transciption (1,3). The enlargement of the second open 
reading frame from 165 amino acids (18,833 Da) to 244 amino acids (27,711 Da) 
resolves two concerns raised by Rasschaert et al.(3) namely a) the second ORF in 
their sequence would require initiation of translation 570 bases downstream from 
the CTAAAC intergenic sequence (or 249 bases downstream of the CTAAAT se¬ 
quence that we propose is used), which is an unusually long distance, b) The sec¬ 
ond ORF in their sequence is not large enough to encode the 24 kD polypeptide 
translated in vitro from TGEV mRNA 3 by Jacobs et al. (5), an mRNA 
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1 TAAATTTAAAATGTTAATTCTATCATCTGCTATAATAGCAGTTGTTTCTGCTAGAGAATTTTGTTAAGGATGATGAATAAAGTCTTTAAGAACTAAACTT 
101 ACGAGT CATT ACAGGT CC T GT ATGGgCAJT GTCA^AT^CAJ T TACA^AT gCGTAGgT GjJTGTAC TTGACGAACT TGAT TGTGCATACT T TGCTGTAA^T C 

201 TTAAAGTAGAATTTAAGAiJTGGTAAATTACTTGTGTgTATAGgTTTTGgTGACA^ACTTCTTGJTGjjTAAGGgTAAAGCATATGCTAAGCTTGGTCTCTC 

301 CATTATTGAAGAAGTCAATAGTCATATAGTTGTTTAATATCATTAAACACACAAAACCCAAAGCATTAAGTGTTACAAAACAATTAAAGAGAGATTATAG 


I I E E V N 


H I V V 


401 AAAAACTGTCATT CTAAAT TCCATGCGAAAATGAjTGGTGGACTTTJTCTTAGTA^TCTGAgTTTTGTAATTGTTAgTAACCATTQTATTGTTAATAACA 
501 ^AGjJAAATGTGCATCATA]ACAACAAGAAC^TGTTAJAGTACAACAGCATCAGGTTGTTA^TGjTAgAA^ACAAAACTATTACCCAGAGTTCA|CATCGC 
601 TGTACgCTTTGTAT^TTTTCTAGCTTTGTACCJTAgTA^AAACTTTAAGAJGTgTGTCGgCAJCTTAATGTTTAAGATTTTATgAATGACACTTTTAGgA 
701 C^TATGCTTATAGCAT^TGgTTACTACATTGATGgCATTGTTA^AAJTGTCTTAT|TTTAAGATJTGTCTACTTAGJATACTpTGGT^TGTTAATA 
801 GTAgGTTTG|ATJTAJTTTATACAATA^AApA^ACTCATGTJTGTACATGgCAjAGJTGJAC^GTTTATGAGAAGTTpCACA|CT|TATTTATGTCAC 
901 ATTGT^TGgTGgCATAAATTATATGTJTGTGAATGJCCTCA^GTTGCATTJTGTAGACCJTATGCTTGTAAgCATAGJAATACGTGGCTTAGCTCATGCT 
1001 GATCTAA^TGTACTTAgAGJAGTTGAACTTCTCAATGgTGATTJTAJTT^TGTATTTT|ACAGGAGC^CGTAGTCGgTGTTTACAATGjAGjCTTTTCTC 
1101 AGGjJGGTTCTAAACGAAAT T GACT TAAAAGAAGAAGAAGAAGACCATA^CT AT GACGTT T £CTAGGGCAT T GAC TGTCATAGAT GACAAT GGAATGGTCA 


MTFPRALTVI DDNGHV 


1201 TTAACATCATTTTCTGGTTCCTGTTGATAAJTATATTGAJATTACTTTgAATAGCATTGCTAAATATAATTAAGCTAT^CATGGTGTGTTGCAATTTAGG 

1301 AAGGA^AGTTATTAJTGTTC£AGCGCAACATGCTTACGATG£CTATAAGAATTTTATGCGAATTAAAGCATACAACCCCGATGGAGCACTCCTTGCTTGA 

1401 A CTAAAC AAAATG 

M 


Fig. 1. Nucleotide sequence between the peplomer and matrix protein genes of TGEV and the deduced 
amino acid sequences for the three large open reading frames. The nucleotide sequence begins with the 
TAA stop codon of the peplomer gene (1, and unpublished data) and ends with the ATG start codon of 
the matrix protein (2). The consensus intergenic sequences are underlined. 


that maps in this region of the genome. The base differences we report were ob¬ 
tained from two separate cDNA clones. 
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